Communication in Multicomputers with Nonconvex Faults
نویسندگان
چکیده
Enhancingcurrentmulticomputer routers for fault-tolerant routing with modest increase in routing complexity and resource requirements is addressed. The proposed method handles solid faults in meshes, which includes all convex faults and many practical nonconvex faults, for example, faults in the shape of L or T. As examples of the proposed method, adaptive andnonadaptive fault-tolerant routing algorithmsusing four virtual channels per physical channel are described.
منابع مشابه
Fault-Tolerance in Augmented Hypercube Multicomputers
This paper describes different schemes for tolerating faults in augmented hypercube multiprocessors. The architectures considered have a spare assigned to each subset of nodes (cluster). The approaches make use of hardware redundancy in the form of spare nodes and/or links and usually requires modifications in the communication as well as computation algorithms.
متن کاملAdaptive Fault-Tolerant Routing in Cube-Based Multicomputers Using Safety Vectors
Reliable communication in cube-based multicomputers using the safety vector concept is studied in this paper. In our approach, each node in a cube-based multicomputer of dimension n is associated with a safety vector of n bits, which is an approximated measure of the number and distribution of faults in the neighborhood. The safety vector of each node can be easily calculated through n 1 rounds...
متن کاملFault Tolerant Multicast Communication in Multicomputers
We describe fault tolerant routing of multi cast messages in mesh based wormhole switched multicom puters With the proposed techniques multiple convex faults can be tolerated The fault information is kept locally each fault free processor needs to know the status of the links incident on it only Furthermore the proposed tech niques are deadlock and livelock free and guarantee deliv ery of messa...
متن کاملFault-Tolerant Multicast Communication for Multicomputers
We describe fault-tolerant routing of multi-cast messages in mesh-based wormhole-switched multicom-puters. With the proposed techniques, multiple convex faults can be tolerated. The fault information is kept locally| each fault-free processor needs to know the status of the links incident on it only. Furthermore, the proposed techniques are deadlock-and livelock-free and guarantee delivery of m...
متن کاملFault-Tolerant Communication with Partitioned Dimension-Order Routers with Complex Faults
ÐThe current fault-tolerant routing methods require extensive changes to practical routers such as the Cray T3D's dimension-order router to handle faults. In this paper, we propose methods to handle faults in multicomputers with dimension-order routers with simple changes to router structure and logic. Our techniques can be applied to current implementations in which the router is partitioned i...
متن کامل